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Abstract 

Till now, (LA)TeX has been used to generate typeset documents. This paper 
points out an alternative view — the generation of audio renderings. The (IA)T£X 
typesetting source captures a lot of document structure which is then used to 
typeset the document. This can be exploited to convey document structure in 
audio renderings of the document as well. This becomes especially important 
when dealing with mathematical documents. Our approach attempts to develop 
notions of audio formatting similar to the well understood notions of visual for- 
matting and layout. Our goal is to do for the audio document what (IA)TeX has 
done for the printed document. 



Introduction 

Lack of ready access to current mathematical and 
technical literature has been a major stumbling 
block throughout my career. I first became inter- 
ested in trying to use (IA)T£X to overcome this prob- 
lem when I received the EAIfeX source for the lec- 
ture notes in a course on the Design and Analysis 
of Algorithms at Cornell. I use a talking computer, 
since I am visually handicapped, and availability of 
technical documents on-line meant that I could have 
them read by my computer. However I soon realised 
the futility of having the computer read out (lAyifcX 
source directly. The only other effective solution 
available at the time, namely to have the printed 
text read aloud, was clearly inadequate. Of course 
I had the option of trying to get the printed output 
read out using a reading machine, but in the case of 
texts with heavy mathematical content, this contin- 
ues to remain impracticable at present. Current day 
OCR 1 technology is incapable of handling typeset 
mathematics. 

Using the (IA)TlE^ sources as a starting point 
for generating audio renderings was a temptation 
too hard to resist. (IA)T]eX captures a lot of syntac- 
tic and sometimes even semantic information about 
the document content, and this information can be 
used for more than just typesetting the document. 
It could be used equally well in producing high qual- 
ity computer-generated audio renderings of the doc- 
ument. This is especially true when it comes to 

1 Optical Character Recognition 



reading mathematical texts. Complicated mathe- 
matical constructs, which prove a major stumbling 
block for conventional OCR technology, present far 
less of a problem, since the (IA)TeX constructs that 
arc used to produce the final output capture a lot of 
information about what is being laid out on paper. 
Thus this information can be effectively captured at 
the (IA)1]eX source level and exploited in produc- 
ing audio. I therefore started work on a system for 
generating audio renderings of technical documents 
presented in the form of (LA)TfcX mark-up source, 

A first attempt 

I first worked on a program that would trans- 
form (LA)T^X source to a form more suitable to 
be read out by a talking computer. Reading the 
(I^)TeX source directly is impractical, since you 
have to listen to a stream of "backslash" and other 
extraneous utterances. 

This first attempt resulted in the development 
of IfcXT^LK [Ram91], a program that T continue to 
use for my day-to-day work as a graduate student 
at Cornell. This program carries out simple trans- 
formations on the (IA)T£X source and the resulting 
text can be viewed using standard UNIX tools. The 
text which is displayed on the screen can then be 
effectively read out by the talking computer. Thus 
presented with the expression: 

l + vs 

2 

the program transforms the above text to: 



Preprint: Proceedings of the 1991 Annual Meeting 



22 Feb 1993 9:30 



1001 



T. V. Raman 



The fraction with numerator J + square root 
o f 5 and denom inator 2 end of fraction 
This is much more intelligible than the following 
cryptic utterance which would be generated if the 
(1A)T£X source were being directly read, 

dollar dollar backslash frac left brace one plus 
backslash sqri left brace five right brace right 
brace left brace two right brace dollar dollar 
TgXTALK modified (LA)TEX mark-up source 
corresponding to mathematical notation and gener- 
ated text that would parallel what a human reading 
out the printed result would say. Sec the appendix, 
"An example of complex math expressions", for a 
more detailed example. I obtained immediate direct 
access to several text books whose authors kindly 
agreed to make available the on-line sources. Cur- 
rently I also have access to the on-line sources for 
the AMS bulletins. Thus the program makes the 
latest mathematical publications accessible to me. 

This first attempt uncovered a lot of interest- 
ing issues, which then led naturally to the next step, 
namely, developing notions of audio formatting anal- 
ogous to the well understood notions of visual for- 
matting. Though the program as it works is em- 
inently usable, the audio renderings generated arc 
not as effective in conveying the complete struc- 
ture of the document. Sub-expressions occurring in 
a complicated expression make perfect sense when 
handled by the system, but it is still difficult to com- 
prehend extremely complicated mathematical ex- 
pressions, especially in terms of understanding how 
the various sub-expressions interact with one an- 
other. The example in the appendix, which shows 
the text generated for a complex math expression, 
makes these shortcomings explicit. 

A rigorous approach 

The initial implementation revealed the need for de- 
veloping a rigorous approach to audio formatting. 
It also became apparent that in order to do full jus- 
tice to the audio, carrying out transformations based 
on a linear scan of the (I^/IfeX source itself was 
not adequate. Even though (I£)TkX source con- 
tains a lot of useful information about document 
structure, the earlier approach of string substitu- 
tion, i.e., scanning the source and applying simple 
transformations fails to fully exploit all of this in- 
formation. The transformations carried out tended 
to be local in nature as string substitution ignores 
global structure and this was identified as a principal 
cause of the ineffectiveness in conveying global struc- 
ture. The above becomes clear when wo compare the 
readings corresponding to a complicated expression 



generated by a trained and experienced reader with 
that generated by the system. Sec the appendix, 
"Text taken verbatim from the RFB recording' 1 and 
"Transformed text generated by T^XTaLK", for a 
comparison. In fact even a straightforward tran- 
scription of the text as present in the recordings 
from the RFB 2 loses a lot of information. This is be- 
cause the trained reader inserts appropriate pauses 
and uses other prosodic cues to convey the nesting 
of complicated sub-expressions. This shows clearly 
that a system that attempts to read out the text 
resulting from carrying out simple transformations 
to the (I£)T£X source will be unable to convey such 
structural information. The trained RFB reader is 
able to insert appropriate cues into the spoken text 
only after having "parsed" and "re- parsed" the ex- 
pression. This shows a clear need for first construct- 
ing higher level representations of the expression in 
order to improve the quality of the audio rendering. 
Thus there arc two steps to audio rendering: 

• Generate the text to be spoken. 

♦ "audio format" this text, i.e., insert appropriate 
cues into the spoken text in order to convey 
structure. 

A two-step approach. The preceding discussion 
shows that a better approach is to subdivide the 
problem into: 

1. Build a high-level model of the document by 
parsing the (1^)1^ source. 

2. Generate audio renderings by applying appro- 
priate audio formatting rules to the resulting 
structure. 

Advantages. This approach no longer suffers from 
the drawbacks alluded to earlier. This is because 
the audio renderings that arc now driven off a 
high level representation of the document can draw 
upon global knowledge of document structure. Thus 
while sub-expressions continue to be formatted for 
audio by applying transformations as before, the au- 
dio renderings can now capture information about 
how various entities appearing in the document re- 
late to one another. The human reader conveys 
such structural information using prosodic cues. 
When using computer-generated speech, the range 
of prosodic cues that we can use is limited in com- 
parison. However these can be augmented by using 
non-speech audio cues. By non-speech audio cues, 
wc mean short chords of music, beeps etc. that can 
be used to convey n on- textual content. These can 
prove extremely useful, since they can be used to 

2 Recordings for the Blind 
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cue the listener to extra-textual content without in- 
troducing unnecessary verbiage in the audio render- 
ings. In addition, the audio documents generated 
from such high level document structures will be ca- 
pable of allowing the user to browse the document. 

Generating high-level document 
models from (IA)TeK source 

This section details the approach we have taken to 
solving the first of the two subproblcms outlined 
in the previous section. Well-written (I^)TeX doc- 
uments capture document structure using macros 
designed to reflect logical rather than layout struc- 
ture. This fact is heavily relied upon when gener- 
ating high level models given specific instances of 
(I£)T£X documents. This forces certain constraints 
on the type of (I/ l )Te>X documents that such a sys- 
tem will be able to handle. These constraints arc 
derived in the following subsections. We then give 
an approach for generating such high-level models 
for (IA)TeX documents satisfying these constraints. 

Defining high-level models. Wc think of high- 
level models of the documents as being given by ab- 
stract syntax trees corresponding to a specific lan- 
guage £. Different document types satisfy different 
languages, and thus have different high level rep- 
resentations. The abstract models wc define arc 
thus specific to a given class of documents. For the 
present wc will consider the article style of IATfcX 

A hierarchical structure. Documents conform- 
ing to the article style of MfcX have a clear hi- 
erarchical structure. The document divides neatly 
into a simple tree structure where the subtrees cor- 
respond to various structural units such as section, 
subsection, etc. I>TeX documents have this struc- 
ture clearly tagged by the use of standard IATeX 
constructs and this allows us to get at the high level 
structure [Lam86]. Similar structures arc also easy 
to obtain from well written T^X documents where 
the structure is marked up using macros from stan- 
dard packages such as plain T£>Y [Knu84]. How- 
ever, since TeX docs not always insist on the use of 
predefined structures for marking up the document, 
this step can prove difficult when dealing with "raw' 1 
T^X documents. 

Defining a recognizable class ol {IA)TEiX docu- 
ments. T[j>\ as described by Knuth [Knu84, Knu86] 
is a powerful typesetting language and embodies 
many features of a programming language. By pro- 
viding primitives normally found in a programming 
language it affords immense flexibility to the de- 
signer of a document. However, with this flexibil- 



ity come a lot of problems, since such power in the 
hands of an average user can prove dangerous. 
All power corrupts and absolute power cor- 
rupts absolutely! 
This statement is true in the world of T^X docu- 
ments as well. A properly prepared T^X document 
uses the power of the language sparingly and avoids 
mixing typesetting commands with document con- 
tent. Using well-designed formats results in (LA)TfcX 
source that clearly reflects the document structure. 
However the case with which new constructs can be 
defined in T^X means that the above principles arc 
often violated. 

Documents which rely on absolute commands 
like \vskip to achieve document structure by pro- 
viding the right visual effect present serious prob- 
lems to a program that is trying to build a higher 
level model of the document. This is to be ex- 
pected since the use of such absolute commands 
within the text of an electronic document indicates 
an assumption that the electronic source will be 
used only for typesetting the document. Thus the 
first and most important constraint that wc impose 
on the class of (LA)T£X documents that wc handle 
is that document structure be clearly tagged using 
only standard macro packages. Thus our current 
parser recognizes an enumerated list in IATftX which 
is clearly marked up, i.e., each new item is explicitly 
tagged using \item. However if an author chooses to 
mark-up certain items in an enumerated list by us- 
ing \itera and other items by \f oo, where \f oo has 
been defined by the author to generate the right vi- 
sual effects required to signal a new item, the parser 
fails to recognize some of the items in the list. 

Classifying (IA)TeX macros. The level of com- 
plexity present in constructing an abstract model 
of a document given its (lA)T^X source is directly 
determined by the type of macros used by the au- 
thor. Macros can be used to mark-up document 
structure, for laying out specific structures and ob- 
jects, and for achieving visual effects. As pointed out 
above, macros arc also used to augment predefined 
formats. In addition macros arc also widely used to 
make the task of key-boarding easier. This multi- 
ple use of macros causes some difficulty when con- 
structing high level models of a (l^jTgX document. 
In order to define the class of ([A)TftX documents 
wc can handle, wc need to classify T&X macros ac- 
cording to how they arc used. This will allow us to 
clearly define the class of macros that wc can handle 
in our document recognition step and will in effect 
dciinc the class of documents that the system will 
be able to recognize. Further, this classification step 
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will also indicate to what extent we should expand 
macros during the recognition step and how the ab- 
stract model is to capture macro calls. 

In the following, Mu is used to denote the uni- 
versal set of all macros. The various subsets arc 
denoted by appropriate suffixes. 

Graphic. This subset will be denoted by Mo- 
Macros that provide primitive typesetting opera- 
tions such as the \Xern and \hrule macros in TcX 
arc typical examples of such macros. They arc char- 
acterized by their visual nature. Explicit use of such 
macros in a document docs not provide sufficient 
information to allow for the construction of an ab- 
stract model. 

Graphic macros representing standard ob- 
jects. This set of macros will be denoted by Mo . 

The term standard object is used here to re- 
fer to commonly occurring entities such as integrals 
and fractions. These objects have a standard visual 
template according to which they arc typeset, and a 
properly composed document renders these visually 
by using specialized macros that take appropriate 
arguments. Thus by their very nature such macros 
capture a functional representation of the object. 
When confronted by such macros in a document, 
it is unwise to try and expand them any further in 
terms of the lower level control sequences from which 
they have been constructed. Thus the \frac macro 
of IM£X contains all the higher level information we 
can get about the object it renders, and the model 
we build should capture this call. 

Another difficult subset of macros that belong 
to this category arc special symbols built up with 
primitive control sequences. This is typical of com- 
plicated combinations of primitive macro calls that 
arc used to create special visual effects. When build- 
ing the higher level model of the document, we need 
to capture the essence of what is being conveyed by 
the use of such macros, rather than the result of 
expanding the macro call. Thus if \Real has been 
built up using a set of primitive visual macros to 
produce a real number symbol, the abstract model 
should stop by capturing just the macro call \Real 
rather than attempting to expand it any further. 
In fact expanding the call will actually eliminate in- 
formation. 

In an ideal world the principal type of macros 
one would encounter when parsing the typesetting 
source would be elements of Mo> However life is not 
so simple, and often electronic documents abound 
with the use of lower level control sequences. Fur- 
ther .MoHMo — $ docs not always hold. It can 
be argued that the \Real control sequence discussed 



earlier actually belongs to Mg- This would be true 
if we did not know what the author intended to rep- 
resent by the use of the macro, which could often be 
the case. 

The above is also true of the use of TgX con- 
trol sequences like \atop and \over which often re- 
quire some knowledge of the context in which they 
arc used in order to come up with a semantic in- 
terpretation of their use. Thus the use of \atop in 
a nested subscript often means conjunction, while 
it means something entirely different when used in 
rendering a Lcgcndrc symbol. Consider the follow- 
ing example: 3 

aijbjkCki 

Ki<p 
\Zj<q 
l<k<r 

which is produced by: 
$$\sum_{{\3criptstylel\le i\le p 
\atop\scriptstylel\le j\le q} 
\atop\scriptstylel\le k\le r} 
a_{ij}b_{jk}c_{ki}$$ 

Macros for simple text substitution. This class 
of macros will be denoted by Mr- These macros 
arc typically used to make the task of key-boarding 
easier. In most situations it is safe to expand these 
macros since their expansion docs not lead to loss of 
structural information. However, once again Mo ^ 
Mt — 0 is not always true. 

Consider the use of the macro \reader in the 
following sentence. 

"We arc working on a \reader for electronic 
documents. " 
where \reader has been defined elsewhere as 
\def \reader{new exciting reading machine} 

This macro clearly belongs to Mr in the context 
in which it is described. Expanding it directly will 
lead to the text "We arc working on a new exciting 
reading machine for electronic documents." This 
can now be easily rendered in audio. 

However, viewed from a different perspective, 
i.e., the use of \reader as a logo, this macro could 
well be said to be in Mo- In the context of au- 
dio formatting, wc may wish to render the result of 
the call to the \reader macro differently. 

Recognizing the type of a macro. Recogniz- 
ing the class to which a given macro belongs is a 
hard problem. In general no universal classification 

3 Taken from The T^Kbook, Chapter 17. 
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will always hold, as is clear from the previous de- 
scription. However we can use some simple heuris- 
tics to classify a major subset of the commonly oc- 
curring macros. Clearly all the primitive typeset- 
ting operations provided by TgX in terms of putting 
marks on paper belong to Mg- Further macros that 
arc known not to be in Mo hut take arguments arc 
typically in Mo, since the use of macro arguments 
normally indicates that a template is being filled up. 
Macros that which do not fall into cither category 
now fall into Mr- Given more information about 
the context in which the macro is being used, it 
becomes possible to further refine the above classi- 
fication. 

Summary of constraints. To summarize, here 
arc the constraints we need to impose on the class 
of (IA)TeX documents we can handle: 

1. Document structure should be clearly marked 
up. 

2. Explicit use of absolute commands should be 
avoided. 

3. Use of macros should reflect semantics and log- 
ical structure rather than physical layout. 

These constraints have been introduced while dis- 
cussing the global document structure. However 
they hold equally well when considering specific 
components of a document, such as mathematical 
expressions occurring within the document. 

Format independent techniques of informa- 
tion capture. The above discussion also reveals 
the need for developing a framework for represent- 
ing information in electronic documents indepen- 
dent of any single "display" method. IfeX goes a 
long way in achieving this for mathematical docu- 
ments. However since ail the primitive operators 
used to achieve this arc also available to the av- 
erage user, T£X documents do not always conform 
to the constraint that information in a document 
be represented independent of formatting details. 
As these concepts become better understood, we 
need to work towards the development of a language 
for representing structured information in electronic 
documents. Some of this has already been achieved 
by SGML 4 [SGM86, Org90] and there is a clear need 
to carry over this work to cover mathematical doc- 
uments as well. Development of such format inde- 
pendent techniques of information capture will allow 
us to provide alternative methods of accessing the 
same information structures. 

A Standard Generalized Mark-up Language 



Reading mathematics aloud 

The previous section pointed out the need for high- 
level information capture independent of specific for- 
matting techniques. Given high level information 
structures, we need to develop adequate techniques 
for accessing this information using alternative per- 
ceptual modalities such as audio. This section ad- 
dresses the various questions that arise in develop- 
ing an effective notation al system for mathematics 
in the audio world. In order to do this, we first ana- 
lyze how visual notation works and attempt to apply 
some of what we learn to the audio world. 

Features of visual math notation. Traditional 
math notation fully exploits the two-dimensional na- 
ture of the visual tablet. It is thcrcforc"not linear" 
but achieves conciseness by using subscripts and su- 
perscripts. It relics on the eye's ability to move 
quickly across the paper, and uses visual cues like 
delimiters of different sizes to cue these structured 
movements. Written mathematics is not "linear" in 
two different senses of the term, i.e., "space" and 
"time". 

1, It uses a two-dimensional display. 

2. It is not linear in time since the eye is able to 
move back and forth across the paper seemingly 
at will. 

Both of these features arc absent in traditional spo- 
ken mathematics. Spoken mathematics on tape is 
linear in time. In addition, it tends to be verbose 
since there is no standard way of alerting the lis- 
tener to complicated syntactic constructions other 
than describing these verbosely (sec Chang [Cha83], 
which is used as a guideline by RPB for reading math- 
ematical texts). This can be directly attributed to 
the apparent lack in audio of the two-dimensional 
nature of the printed medium. This lack of two- 
dimensionality is overcome by the reader inserting 
words like "open quantity" and "raised to the quan- 
tity" in order to cue the listener to the presence of 
complex constructions. Use of such cues, though 
effective, causes the audio rendering to be necessar- 
ily verbose, making it difficult to grasp the essence 
of what is being conveyed. Mathematical notation 
relics on conciseness to express high- level concepts, 
and as the complexity of the expressions being han- 
dled increases, the resulting verbiage in the spoken 
equivalent of the written expression renders it. prac- 
tically unusable, Wc need adequate audio substi- 
tutes for these features of mathematical notation 
if wc are to have any hope of effectively conveying 
mathematics using audio. 
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Spoken mathematics. The previous paragraph 
takes care to speak of mathematics in audio rather 
than merely referring to "spoken mathematics". 
This choice of terminology is intentional. One of the 
ways traditional spoken mathematics conveys com- 
plex interrelationships between sub-expressions is 
to use prosodic cues within the speech. Sec the ap- 
pendix, "Text taken verbatim from RFB's record- 
ing", for an explicit example. In order to achieve 
the full expressiveness of a human reading mathe- 
matics we need to use a lot of prosodic cues in the 
speech. Though computers of today can talk in- 
telligibly they arc still a long way from achieving 
this degree of expressiveness. We therefore need to 
augment the computer's speech ability by a well de- 
signed system of audio cues which can then be used 
to better convey extra-textual content when reading 
complicated expressions. 

Multiple channels of audio. Non-speech audio 
cues can be synchronized with the speech output. 
Thus by using multiple channels of audio output, 
i.e., by having both speech and non-speech audio 
cues playing at the same time, and exploiting direc- 
tionality of sound, we can offset the disadvantages 
resulting from not having a two-dimensional display. 
In fact the audio document is not restricted to a Hat 
display, and proper use of audio cues can result in 
effective communication of complex constructs. 

Browsing in an audio document. The previous 
subsection addresses the problems resulting from 
traditional audio documents being linear in space. 
This subsection in turn addresses the problems re- 
sulting from the fact that conventional recordings 
on tape have been linear in time. 

A serious difficulty faced when listening to the 
recording of a complex expression is that the listener 
is forced to retain the entire expression. Thus com- 
prehending spoken mathematics demands a longer 
attention span. In fact often it becomes impossible 
to remember the beginning of a complex expression 
by the time one has reached its end. This is clearly 
evinced by the reading of Faa dc Bruno's formula 
presented in the appendix of this paper. Visual no- 
tation, by using different sized delimiters, different 
levels of subscripts, etc., cues these structured move- 
ments, and thereby allows the eye to move around 
the printed expression. Analogously wc need to al- 
low the listener to move around the expression and 
access parts of it at will. This will obviate the need 
for the listener to retain the entire expression in 
memory. 

These structured movements can be performed 
using the high-level model for the expression that 



has been constructed at the recognition step. This 
is one of the major advantages of first recognizing 
the structure before generating audio renderings. 

Conclusion. 

This paper describes an audio view of (iA)TgX doc- 
uments. Electronic typesetting source can be used 
to generate audio documents as well. In order to 
do this effectively, wc need to recognize document 
structure from the electronic source. Audio render- 
ings will then be driven from this high level struc- 
ture. There is a need to develop notions of audio for- 
matting analogous to the well understood notions of 
visual formatting. Finally wc need to better under- 
stand how visual browsing works in order to build 
into the system the ability to provide the same func- 
tionality in the audio setting. 
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Appendix: An example of complex math expressions 

This appendix gives the TEXsourcc for a complicated mathematical expression, the transformed text gener- 
ated by Te>XTaLK, and finally the text as read on the RFB recording of the same expression. 

Note: This piece of mathematical text is taken from Art of Programming, Vol. 1, 2nd Edition, Section 
1.2.5, p. 50, Ex. 21 by Donald E. knuth. 
The TEiXsource 

{\bf 21.} {\em [HM25]} (Faa di Bruno's formula.) 

Let $D~k_x u$ represent the $k$th derivative of a function $u$ with 
respect to $x$. The "chain rule" states that $D~l_xw = D~i_u w 
D~l_x u$. If we apply this to second derivatives, we find $D~2_xw = 
D~2_u w (D~l_x u)~2+D"l_u w D"2_x u$. 
Show that the {\em general formula\/> is 
$$D~n_xw = 
\sum_{0\le j\le n> 

\3um_{\ s cr ipt s t y le k_ l+k_2+\cdot s+k_n= j 

\atop {\scriptstyle k_l+2k_2+\cdots+nk_n=n 

\atop {\scriptstyle k_l ,k_2 , \ldots,k_n\geO >}> 

D~j_u w \frac{n!Hk_l!-C(l!)} A {k_l} \cdots k_n!{(nf )}~{k_n}} 

{(D*l.x u)}~{k_l} \cdots {(D^n.x u)}~{k_n}.$$ 

The above piece of TfcXcodc was keyed in while listening to the reading of the expression on the RFB 
tape. This shows up an interesting fact about ordering of subscripts and superscripts. In this case the reader 
has read the superscript first, and the TEXsourcc reflects this in that it uses D~l„x rather than the more 
standard D_x~l as recommended in The Tf$ibook. At this time, there seems to be no reason to choose the 
reader's order of speaking the subscript after the superscript in the general case as against the order one 
would use if scanning the standard TfcXusagc linearly. In the case of Faa Dc Bruno's formula, the reader 
has used his interpretation of D l x in making this choice. In the interest of being able to easily parse the 
T£Xsourcc, we need to stick to cither one of the two ordcrings when writing TcXdocumcnts. The order in 
which the subscripts and superscripts arc eventually read can then be decided at a later stage in the audio 
formatting. 

Text taken verbatim from RFB's recording 

Exercise 21has a rating of cap h cap m 25 
Faa dc Bruno's formula. 

Let D super k sub x of u represent the kth derivative of a function u with respect to x. The chain rule states 
that cap d super 1 sub x of w equals cap d super 1 sub u of w cap d super 1 sub x of u. If we apply this to 
second derivatives, wc find that cap d super 2 sub x of w equals cap d super 2 sub u of w times the quantity 
cap d super 1 sub x of u quantity squared plus cap d super 1 sub u of w times cap d super 2 sub x of u. 
Show that the general formula is cap d super n sub x of w equals the summation from 0 less than or equal 
j less than or equal n of the summation over k sub 1 plus k sub 2 plus and so forth plus k sub n equals j, k 
sub 1 plus 2 k sub 2 plus and so forth plus n k sub n equals n, k sub 1 comma k sub 2 comma and so forth 
comma k sub n greater than or equal to zero The quantity being summed is 

cap d super j sub u of w times the fraction n factorial over k sub one factorial times one factorial raised 
to the k sub 1 times and so forth times k sub n factorial times n factorial raised to the k sub n the entire 
fraction times 

the quantity cap d super 1 sub x of u closed quantity raised to the k sub 1 times and so forth times the 
quantity cap d super n sub x of u closed quantity raised to the k sub n 

Transformed text generated by TgXT\LK 
21. [HM25] (Faa di Bruno's formula.) 

Let D super k sub x u represent the k th derivative of a function u with respect to x . The "chain rule" 
states that D super 1 sub xw = D super 1 sub u w D super 1 sub x u . If wc apply this to second derivatives, 
wc find D super 2 sub x w = D super 2 sub u w ( D super 1 sub x u ) super 2 -f D super 1 sub u w D 
super 2 sub x u . Show that the general formula is 

D super n sub x w = sum sub 0 less than or equals j less than or equals n sum sub k sub 1 -f k sub 2 -f 
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ellipses + k sub n = j and below that k sub 1 + 2 k sub 2 + ellipses + n k sub n = n and below that k sub 1 
, k sub 2 , and so on , k sub n greater than or equals 0 D super j sub u vv fraction with numerator n factorial 
and denominator k sub 1 factorial ( 1 factorial ) super k sub 1 ellipses k sub n factorial ( n factorial ) super 
k sub n end of fraction ( D super 1 sub x u ) super k sub 1 ellipses ( D super n sub x u ) super k sub n . 

The resulting formatted output 

21. [HMS5] (Faa di Bruno's formula.) 

Let D%u represent the Ath derivative of a function u with respect to x. The "chain rule" states that 
D x w - d[wD t u. If we apply this to second derivatives, wc rind D 2 x w = Dlw(D x x u) 2 + D x u xvDlu. Show 
that the general formula is 



Diw 




0<j<n k 1 +k- 2 +-+k n =j 
ki+2k?+-+nk n =r> 
ki,k 2 ,...,k„>0 
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